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1 TITLE 
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3 PRIMARY/BACKUP STRUCTURE 

4 

5 CLAIM OF PRIORITY 

6 [0001] This application claims priority to an application entitled "SYSTEM COMBINED WITH 

7 LOADSHARING STRUCTURE AND PRIMARY/BACKUP STRUCTURE", filed in the 

8 Korean Intellectual Property Office on October 15, 2002 and assigned Serial No. 2002-62842, 

9 the contents of which are hereby incorporated by reference. 
10 

1 1 BACKGROUND OF THE INVENTION 

12 1. Field of the Invention 

13 [0002] The present invention relates to a distributed control system, and more particularly, to a 

14 system combined with a load sharing structure and a primary/backup structure for efficiently and 

1 5 stably operating the system. 
16 

17 2. Description of the Related Art 

18 [0003] Generally, systems for distributively controlling many physically-distributed systems are 

19 divided into either a dual system (hereinafter, primary/backup structure) or a load sharing 

20 system. Systems that are primary/backup systems have resources and measures in place when a 

21 primary unit malfunctions. If a primary unit malfunctions in a primary/backup structure, the 
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;l system can complete the processing of an event by using a backup unit. In such primary/backup 

,2 systems, the backup unit contains all the information (is a duplicate) of the primary unit. 

3 [0004] Systems that are load sharing have a plurality of units that can process events. The work 

4 in processing the events is distributed among the units so that to alleviate congestion that would 

5 result if only one unit is used. 

6 [0005] However, primary/backup systems are inefficient because every piece of data must be 

7 duplicated into the backup unit. This requires a lot of overhead and resources to duplicate 

8 everything. Load sharing systems do not have primary/backup capabilities and systems with 

9 primary/backup capabilities do not have load sharing capabilities. 

10 [0006] The above problems will be described as follows. The primary/backup structure does not 

1 1 have an efficient load sharing system and does not use all available system resources, thereby 

12 causing a waste of resources and deteriorating system efficiency. The primaryftackup structure 

13 must be always increased in pairs, one for the primary unit and the other for the backup unit, 

14 causing an increase of cost. 

15 [0007] The load sharing structure is advantageous in that it uses system resources efficiently by 

16 distributing processing of events in a uniform way among the units or sub-systems. However, 

17 this advantage is realized only when all systems are normally operated. If a certain system 

18 malfunctions in a load sharing structure, it becomes impossible to continue processing events in 

19 the down system, and furthermore, shared data occupied to be used cannot be restored. As a 

20 result, the load sharing system becomes unreliable because the load sharing system lacks backup 

2 1 and restoration capabilities. 
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. i SUMMARY OF THE INVENTION 

. 2 [0008] It is therefore an object to provide an improved distributed control system. 

3 [0009] It is another object of the present invention to provide a distributed control system with 

4 improved system capacity while reducing or eliminating waste of resources. 

5 [0010] It is also an object of the present invention to provide a distributed control system that has 

6 both primaryfaackup capabilities and load sharing capabilities in a single system. 

7 [0011] It is also an object of the present invention to provide a distributed control system with a 

8 combined load sharing structure and a primary/backup structure, where only data necessary for 

9 restoring shared resources and event processing is duplicated, reducing the resources for backup 

10 and thus achieving a more stable and efficient system. 

1 1 [0012] These and other object may be achieved by a system having many sub-systems, each sub- 

12 system having a primary unit to share the event processing load according to load sharing 

13 processing order for events. Each sub-system also has a backup unit to receive and store as little 

14 data as necessary for restoration of a primary unit, the data being stored in the backup unit of a 

15 predetermined sub-system. The system further has a configuration management unit designating 

16 which backup unit is for each primary unit as well as managing which backup unit is for which 

17 primary unit. The system fiirther has a distributed algorithm processing unit determining which 

18 sub-system processes events when the events are generated. The system also has a shared 

19 resource which is a resource accessible, shared and used by each sub-system and occupied in a 

20 primary units, the shared resource manages the processing of generated events. The system 

21 further includes a distributed control environment composed of a middleware platform enabling 
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■l distributed processing among each sub-system, the configuration management unit, the 

. 2 distributed algorithm processing unit, and the shared resource. 
3 

4 BRIEF DESCRIPTION OF THE DRAWINGS 

5 [0013] A more complete appreciation of the invention, and many of the attendant advantages 

6 thereof, will be readily apparent as the same becomes better understood by reference to the 

7 following detailed description when considered in conjunction with the accompanying drawings 

8 in which like reference symbols indicate the same or similar components, wherein: 

9 [0014] FIG. 1A is a conceptual diagram illustrating a distributed control system with a 

10 primary/backup structure between different sub-systems; 

n [0015] FIG. IB is a conceptual diagram illustrating a distributed control system with a load 

12 sharing structure between different sub-systems; 

13 [0016] FIG. 2A is a conceptual diagram illustrating a distributed control system with both a load 

14 sharing structure and a primary/backup structure in accordance with one embodiment of the 

15 present invention; 

16 [0017] FIG. 2B is a block diagram illustrating a distributed control system with both a load 

17 sharing and backup setting relations when events are generated in a system of FIG. 2 A; 

18 [0018] FIG. 3 A is a block diagram illustrating a process of a specific backup unit being changed 

19 to a specific primary unit according to the relations of FIG. 2B when a specific primary unit 

20 malfunctions; and 
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;1 [0019] FIG. 3B is a block diagram illustrating load sharing processes of remaining primary units 

. 2 when a specific primary is malfunctioning according to the relations illustrated in FIG. 2B. 

3 

4 DETAILED DESCRIPTION OF THE INVENTION 

5 [0020] Turning to the figures, FIG. 1A is a conceptual diagram illustrating a distributed control 

6 system 40 with a primary/backup structure between different sub-systems within the distributed 

7 control system, and FIG. IB is a conceptual diagram illustrating a distributed control system 50 

8 with a load sharing structure between different sub-systems in the distributed control system. 

9 Referring to FIG. 1A, the primary/backup structure performs event processing by primary unit 

10 1 1 in sub-system 10 and duplicating the contents of primary unit 1 1 into backup unit 22 in sub- 

11 system 20. If primary unit 11 should happen to malfunction, backup unit 22 will finish 

12 processing the event since backup unit 22 contains all the information of primary unit 1 1 prior to 

13 the malfunction. 

14 [0021] Thus, data of primary unit 1 1 and backup unit 22 are identical. If primary unit 1 1 

15 malfunctions, backup unit 22 takes over and becomes a new primary unit and processes the 

16 events waiting to be processed by primary unit 11, thus continuing normal operation on the 

17 processing of the events. However, in the primary/backup structure of FIG. 1 A, primary unit 1 1 

18 always processes all events regardless of the amount of the events waiting to be processed, and 

19 the sub-system 20 and backup unit 22 sits by idly in a standby mode while continually 

20 duplicating data pertaining to processed events found in primary unit 1 1 . In FIG. 1 A, because no 

21 resources other than primary unit 11 processes events, the distributed control system 40 of FIG. 
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;l 1 A can become bottlenecked and congested as no other resources are dedicated to processing the 

. 2 events. Meanwhile, backup unit 22 is a large resource that duplicates primary unit 1 1 but does 

3 not help in alleviating the bottleneck, resulting in wasted resources unless and until primary unit 

4 1 1 malfunctions. 

5 [0022] Turning to FIG. IB, FIG. IB illustrates a load sharing distributed control system 50 that 

6 uniformly distributes the work load among three primary units so that there is no longer the 

7 bottleneck problem of FIG. 1 A. In the load sharing system of FIG. IB, all three primary units 

8 1 1, 22 and 31 work together to process events. Since all three sub-systems 10, 20 and 30 contain 

9 only primary units with no backup units, when one or more of the primary units malfunctions, it 

10 becomes impossible to continue and complete the processing of events. 

1 1 (0023) Turning to FIG. 2A, FIG. 2A illustrates a distributed control system 60 having both a load 

12 sharing structure and a primaryftackup structure in accordance with the one embodiment of the 

13 present invention. System 60 has three primary units 1 10, 210 and 310 disposed in sub-systems 

14 100, 200 and 300 respectively. System 60 further has backup units 120, 220 and 320 disposed in 

15 sub-systems 100, 200 and 300 respectively. Backup units 120, 220 and 320 correspond to 

16 primary units 310, 1 10, and 210, respectively, as indicated by the arrows in FIG. 2 A. System 60 

17 further includes a distributed control environment 400 comprised of a middleware platform and 

18 being programmed and configured to distribute processing among the plurality of sub-systems, 

19 the configuration management unit, the distributed algorithm processing unit, and the shared 

20 resource unit., a Configuration Management unit (or CM unit) 500, a distributed algorithm 

21 processing unit 600, a logical shared resource unit 700, and an event generation unit 800. 
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;l [0024] Configuration management (CM) unit 500 generates the primary units 1 10, 210, and 310 

. 2 and configures the backup units 120, 220, and 320. If a new sub-system is added, a primary unit 

3 410 (not illustrated) is newly generated by the CM unit 500, and a new backup unit is 

4 reconfigured for the newly-generated primary unit 410 (not illustrated). CM unit 500 separately 

5 manages an index linking the backup units 120, 220, and 320 to the corresponding primary units 

6 310, 110 and 210 respectively. CM unit 500 manages only the primary units 110, 210, and 310 

7 and backup units 120, 220 and 320. CM unit 500 is not involved in generation of the events. 

8 [0025] The logical shared resource unit 700 manages processing of the generated events. When a 

9 new event shared by the primary units 1 10, 210 and 310 is generated, the logical shared resource 

10 unit 700 is occupied by the primary units 1 10, 210 and 310. If the primary/backup structure of 

1 1 FIG. 2A is constructed and the new event is generated in the shared state with the logical shared 

12 resource unit 700, a process for processing the event in the primary units 110, 210 and 310, a 

13 process for continuously updating the backup units 120, 220 and 320, and a process for restoring 

14 the resources are carried out. 

15 [0026] A primary designating method is determined by the distributed algorithm processing unit 

16 600 using 1) a round robin method for distributing and designating the events, one event for each 

17 primary unit 110, 210 and 310 and 2) a method of distributing the events toward a least 

18 congested primary unit. Round robin means the first event is assigned to primary unit 1 10 for 

19 processing, the second event is assigned to primary unit 210, the third unit is assigned to primary 

20 unit 310, the fourth event is assigned to primary unit 1 10, and so on. The present invention will 
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; l describe a distributing method in accordance with the distributed algorithm processing unit 600 

.2 as an example. 

3 [0027] Referring to FIG. 2B, suppose that a primary unit 110 is assigned to process certain 

4 events determined by the distributed algorithm processing unit 600. Primary unit 1 1 0 receives 

5 the certain assigned events from event generating unit 800 to occupy a necessary logical shared 

6 resource unit 700, and proceeds to process the events. Also, primary unit 1 10 informs a backup 

7 unit 220 of a sub-system 200 to maintain data needed to reconstruct and replicate the primary 

8 unit 110. Because duplication of all the data of primary unit 110 into backup unit 220 is 

9 laborious and not necessary, only a minimum data needed for restoration (e.g., an index for the 

10 events, an ongoing status, and occupied resources) is maintained in backup unit 220. When a 

11 new event is generated, it is distributed to the primary unit 210 and primary unit 310 by 

12 distributed algorithm processing unit 600. Also backup unit 320 and backup unit 120 keep a 

13 minimal amount of data necessary for restoration in an event that primary units 210 or 310 

14 should fail. 

15 [0028] Turning to FIGS. 3 A and 3B, FIGS. 3 A and 3B illustrate a distributed control system 70 

16 having three primary units and three backup units when one of the three primary units fails or 

17 malfunctions. When primary unit 110 fails, backup unit 220 takes over the processing of events 

18 formerly processed by primary unit 110. Therefore, backup unit 220 becomes a primary unit 

19 110'. FIG. 3B illustrates a block diagram illustrating load sharing processes of remaining 

20 primary units 210 and 310 when primary unit 110 malfunctions according to the same relations 

21 illustrated in FIG. 2B. 
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;1 [0029] As illustrated in FIG. 3 A, when a primary unit 110 fails, backup unit 220 becomes 

. 2 primary unit 1 10 ! by a distributed control environment 400 and CM unit 500. At this time, the 

3 primary unit 110' continues processing events previously handled by primary unit 110. Primary 

4 unit 110' does not process any new events as primary 1 10 f simply restores resources formally 

5 found in primary 110. If further events do not need to be processed, the function of the primary 

6 unit 1 1 0' terminates. 

7 [0030] When a new event is generated, like illustrated in FIG. 3B, the CM unit 500 and the 

8 distributed control environment 400 designates primary units 210 and 310 to process the events 

9 and take over the processing load originally destined for primary unit 1 10. Even though primary 

10 unit 210 and primary unit 110' are both disposed in the same sub- system 200, the primary unit 

11 110' is used only to process events originally designated to be processed by primary 110. 

12 Primary 1 10' is does not participate in the load sharing with primary units 210 and 310. Thus, in 

13 the system 70 of FIG. 3B, if primary unit 110 fails and backup unit 220 becomes primary unit 

14 110' which serves to replicate primary unit 110, primary unit 110 and primary unit 1 10 f do not 

15 participate in the load sharing with primary units 210 and 310. Therefore, newly generated 

16 events generated when primary unit 1 10 is not working are processed only by primary units 210 

17 and 310. When primary unit 110 should later be restored and become functional, primary unit 

18 110' reverts back to becoming backup unit 220 and primary unit 110, having been restored, 

1 9 serves to load share with primary units 210 and 3 1 0. 

20 [0031] Since the primary units 110, 210 and 310 perform load sharing processing according to 

21 the above process, the events can be processed uniformly among the three primary units where 
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;1 each of the three primary units shares the load in processing the events. As a result, there is no 

. 2 waste of resources. Moreover, it is possible to load share while providing a stable environment 

3 should one of the primary units should happen to malfunction. 

4 [0032] It is to be understood that changes and modifications to the embodiments described above 

5 will be apparent to those skilled in the art, and are contemplated. It is therefore intended that the 

6 foregoing detailed description be regarded as illustrative rather than limiting, and that it be 

7 understood that it is the following claims, including all equivalents, that are intended to define 

8 the spirit and scope of this invention. 

9 [0033] Like described so far, the present invention obtains advantages only separately supplied 

10 from a primary^ackup structure and a load sharing structure, and the advantages are as follows: 

11 1) there is no waste of resources since all existing systems are used, 2) it (the combined system) 

12 has good expandability by easily applying necessary systems, 3) it secures reliability with 

13 excellent economical efficiency, since maximum capacity can be obtained with a minimum 

14 system, 4) it can keep a data sharing function, an advantage of a distributed system as it is and 5) 

15 it keeps a primary/backup structure even with one system. 

16 [0034] Summarizing the above advantages, the present invention processes generated events by 

17 distributing load and uses all available systems, thereby improving system capacity without a 

18 waste of resources as well as stably and efficiently operating the system by duplicating data with 

19 the use of a backup system. 

20 [0035] While this invention has been particularly illustrated and described with reference to 

21 preferred embodiments thereof, it will be understood by those skilled in the art that various 
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1 changes in form and details may be made therein without departing from the spirit and scope of 
.2 the invention as defined by the appended claims. The preferred embodiments should be 

3 considered in descriptive sense only and not for purposes of limitation. Therefore, the scope of 

4 the invention is defined by the appended claims. 
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